Evaluating Parsing Schemes with Entropy Indicators

نویسندگان

  • Caroline Lyon
  • Stephen Brown
چکیده

This paper introduces an objective metric for evaluating a parsing scheme It is based on Shannon s original work with letter sequences which can be extended to part of speech tag sequences It is shown that this regular language is an inadequate model for natural language but a representation is used that models language slightly higher in the Chomsky hierarchy We show how the entropy of parsed and unparsed sentences can be measured If the entropy of the parsed sentence is lower this indicates that some of the structure of the language has been captured We apply this entropy indicator to support one particular parsing scheme that e ects a top down segmentation This approach could be used to decompose the parsing task into computationally more tractable subtasks It also lends itself to the extraction of predicate argument structure

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Entropy-Based Requirements Elicitation

This paper presents our views towards constructing the universal parsing technique for the software requirements texts and the requirements elicitation technique based on the output of this parsing process. With the proposed parsing technique, it should be possible to achieve language-independent processing of the requirements texts. Source sentences are treated as systems with words as element...

متن کامل

Entropy of Hybrid Censoring Schemes

A hybrid censoring scheme is a mixture of type I and type II censoring schemes. When $n$ items are placed on a life test, the experiment terminates under type I or type II hybrid censoring scheme if either a pre-fixed censoring time T or the rth (1<=r<=n&nbsp;is fixed) failure is first or later observed, respectively. In this paper, we investigate the decomposition of entropy in both hybrid cen...

متن کامل

Measures and Models

I present an entropy measure for evaluating parser performance. The measure is fine-grained, and permits us to evaluate performance at the level of individual phrases. The parsing problem is characterized as statistically approximating the Penn Treebank annotations. I consider a series of models to “calibrate” the measure by determining what scores can be achieved using the most obvious kinds o...

متن کامل

Measures And Models For Phrase Recognition

I present an entropy measure for evaluating parser performance. The measure is fine-grained, and permits us to evaluate performance at the level of individual phrases. The parsing problem is characterized as statistically approximating the Penn Treebank annotations. I consider a series of models to “calibrate” the measure by determining what scores can be achieved using the most obvious kinds o...

متن کامل

Lookahead Search for Lossy Context-Based Adaptive Entropy Coding

In this paper we motivate the need for lookahead search in a context based entropy coder. An efficient algorithm based on modeling of the context coder as a finite state machine is presented. A key contribution of this paper is the use of the per survivor processing principle (PSP) to enable a lookahead search in scenarios where adaptive entropy coding is used. Our results show that lookahead s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cmp-lg/9709009  شماره 

صفحات  -

تاریخ انتشار 1997